Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 198481 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 2588 |
| Duplicate rows (%) | 1.3% |
| Total size in memory | 174.4 MiB |
| Average record size in memory | 921.3 B |
Variable types
| Boolean | 3 |
|---|---|
| Numeric | 6 |
| Categorical | 12 |
| Dataset has 2588 (1.3%) duplicate rows | Duplicates |
party_sobriety is highly overall correlated with party_drug_physical | High correlation |
party_drug_physical is highly overall correlated with party_sobriety | High correlation |
vehicle_type is highly overall correlated with vehicle_transmission | High correlation |
vehicle_transmission is highly overall correlated with vehicle_type | High correlation |
direction is highly overall correlated with intersection | High correlation |
intersection is highly overall correlated with direction and 1 other fields | High correlation |
weather_1 is highly overall correlated with road_surface | High correlation |
primary_collision_factor is highly overall correlated with pcf_violation_category | High correlation |
pcf_violation_category is highly overall correlated with intersection and 1 other fields | High correlation |
road_surface is highly overall correlated with weather_1 | High correlation |
party_sobriety is highly imbalanced (53.9%) | Imbalance |
party_drug_physical is highly imbalanced (64.5%) | Imbalance |
cellphone_in_use is highly imbalanced (86.0%) | Imbalance |
vehicle_type is highly imbalanced (53.9%) | Imbalance |
weather_1 is highly imbalanced (68.7%) | Imbalance |
primary_collision_factor is highly imbalanced (97.3%) | Imbalance |
road_surface is highly imbalanced (74.6%) | Imbalance |
road_condition_1 is highly imbalanced (91.7%) | Imbalance |
distance is highly skewed (γ1 = 159.4162036) | Skewed |
insurance_premium has 33043 (16.6%) zeros | Zeros |
vehicle_age has 147442 (74.3%) zeros | Zeros |
distance has 41751 (21.0%) zeros | Zeros |
collision_time has 5407 (2.7%) zeros | Zeros |
Reproduction
| Analysis started | 2023-11-15 15:55:42.519347 |
|---|---|
| Analysis finished | 2023-11-15 15:56:03.679859 |
| Duration | 21.16 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
at_fault
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 103047 | |
| False | 95434 |
insurance_premium
Real number (ℝ)
| Distinct | 105 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.229841 |
| Minimum | 0 |
|---|---|
| Maximum | 105 |
| Zeros | 33043 |
| Zeros (%) | 16.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 20 |
| median | 31 |
| Q3 | 47 |
| 95-th percentile | 67 |
| Maximum | 105 |
| Range | 105 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 20.686013 |
|---|---|
| Coefficient of variation (CV) | 0.64182798 |
| Kurtosis | -0.49810277 |
| Mean | 32.229841 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.14408585 |
| Sum | 6397011 |
| Variance | 427.91115 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 33043 | 16.6% |
| 21 | 5642 | 2.8% |
| 20 | 5482 | 2.8% |
| 19 | 5389 | 2.7% |
| 22 | 5335 | 2.7% |
| 23 | 5082 | 2.6% |
| 24 | 4657 | 2.3% |
| 25 | 4528 | 2.3% |
| 18 | 4467 | 2.3% |
| 26 | 4334 | 2.2% |
| Other values (95) | 120522 |
| Value | Count | Frequency (%) |
| 0 | 33043 | |
| 1 | 10 | < 0.1% |
| 2 | 12 | < 0.1% |
| 3 | 13 | < 0.1% |
| 4 | 15 | < 0.1% |
| 5 | 24 | < 0.1% |
| 6 | 18 | < 0.1% |
| 7 | 31 | < 0.1% |
| 8 | 22 | < 0.1% |
| 9 | 27 | < 0.1% |
| Value | Count | Frequency (%) |
| 105 | 1 | < 0.1% |
| 104 | 1 | < 0.1% |
| 102 | 1 | < 0.1% |
| 101 | 3 | < 0.1% |
| 100 | 3 | < 0.1% |
| 99 | 7 | < 0.1% |
| 98 | 5 | < 0.1% |
| 97 | 8 | < 0.1% |
| 96 | 15 | |
| 95 | 21 |
party_sobriety
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
| had not been drinking | |
|---|---|
| impairment unknown | |
| not applicable | 13458 |
| had been drinking, under influence | 10139 |
| had been drinking, impairment unknown | 1550 |
Length
| Max length | 38 |
|---|---|
| Median length | 21 |
| Mean length | 21.141097 |
| Min length | 14 |
Characters and Unicode
| Total characters | 4196106 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | had not been drinking |
|---|---|
| 2nd row | had not been drinking |
| 3rd row | had not been drinking |
| 4th row | had not been drinking |
| 5th row | had not been drinking |
Common Values
| Value | Count | Frequency (%) |
| had not been drinking | 153342 | |
| impairment unknown | 18713 | 9.4% |
| not applicable | 13458 | 6.8% |
| had been drinking, under influence | 10139 | 5.1% |
| had been drinking, impairment unknown | 1550 | 0.8% |
| had been drinking, not under influence | 1279 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 168079 | |
| had | 166310 | |
| been | 166310 | |
| drinking | 166310 | |
| impairment | 20263 | 2.7% |
| unknown | 20263 | 2.7% |
| applicable | 13458 | 1.8% |
| under | 11418 | 1.5% |
| influence | 11418 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 782315 | |
| 545348 | ||
| e | 400595 | |
| i | 398022 | |
| d | 344038 | |
| a | 213489 | 5.1% |
| r | 197991 | 4.7% |
| o | 188342 | 4.5% |
| t | 188342 | 4.5% |
| k | 186573 | 4.4% |
| Other values (11) | 751051 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3637790 | |
| Space Separator | 545348 | 13.0% |
| Other Punctuation | 12968 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 782315 | |
| e | 400595 | |
| i | 398022 | |
| d | 344038 | |
| a | 213489 | 5.9% |
| r | 197991 | 5.4% |
| o | 188342 | 5.2% |
| t | 188342 | 5.2% |
| k | 186573 | 5.1% |
| b | 179768 | 4.9% |
| Other values (9) | 558315 |
Space Separator
| Value | Count | Frequency (%) |
| 545348 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 12968 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3637790 | |
| Common | 558316 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 782315 | |
| e | 400595 | |
| i | 398022 | |
| d | 344038 | |
| a | 213489 | 5.9% |
| r | 197991 | 5.4% |
| o | 188342 | 5.2% |
| t | 188342 | 5.2% |
| k | 186573 | 5.1% |
| b | 179768 | 4.9% |
| Other values (9) | 558315 |
Common
| Value | Count | Frequency (%) |
| 545348 | ||
| , | 12968 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4196106 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 782315 | |
| 545348 | ||
| e | 400595 | |
| i | 398022 | |
| d | 344038 | |
| a | 213489 | 5.1% |
| r | 197991 | 4.7% |
| o | 188342 | 4.5% |
| t | 188342 | 4.5% |
| k | 186573 | 4.4% |
| Other values (11) | 751051 |
party_drug_physical
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.8 MiB |
| no drugs | |
|---|---|
| G | |
| not applicable | 13458 |
| under drug influence | 1516 |
| sleepy/fatigued | 1072 |
Length
| Max length | 21 |
|---|---|
| Median length | 8 |
| Mean length | 7.8870673 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1565433 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no drugs |
|---|---|
| 2nd row | no drugs |
| 3rd row | no drugs |
| 4th row | no drugs |
| 5th row | no drugs |
Common Values
| Value | Count | Frequency (%) |
| no drugs | 163558 | |
| G | 18713 | 9.4% |
| not applicable | 13458 | 6.8% |
| under drug influence | 1516 | 0.8% |
| sleepy/fatigued | 1072 | 0.5% |
| impairment - physical | 164 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 163558 | |
| drugs | 163558 | |
| g | 18713 | 4.9% |
| not | 13458 | 3.6% |
| applicable | 13458 | 3.6% |
| under | 1516 | 0.4% |
| drug | 1516 | 0.4% |
| influence | 1516 | 0.4% |
| sleepy/fatigued | 1072 | 0.3% |
| impairment | 164 | < 0.1% |
| Other values (2) | 328 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 181728 | |
| 180376 | ||
| o | 177016 | |
| u | 169178 | |
| d | 167662 | |
| r | 166754 | |
| g | 166146 | |
| s | 164794 | |
| l | 29668 | 1.9% |
| a | 28316 | 1.8% |
| Other values (13) | 133795 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1365108 | |
| Space Separator | 180376 | 11.5% |
| Uppercase Letter | 18713 | 1.2% |
| Other Punctuation | 1072 | 0.1% |
| Dash Punctuation | 164 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 181728 | |
| o | 177016 | |
| u | 169178 | |
| d | 167662 | |
| r | 166754 | |
| g | 166146 | |
| s | 164794 | |
| l | 29668 | 2.2% |
| a | 28316 | 2.1% |
| p | 28316 | 2.1% |
| Other values (9) | 85530 |
Space Separator
| Value | Count | Frequency (%) |
| 180376 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 18713 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1072 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1383821 | |
| Common | 181612 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 181728 | |
| o | 177016 | |
| u | 169178 | |
| d | 167662 | |
| r | 166754 | |
| g | 166146 | |
| s | 164794 | |
| l | 29668 | 2.1% |
| a | 28316 | 2.0% |
| p | 28316 | 2.0% |
| Other values (10) | 104243 |
Common
| Value | Count | Frequency (%) |
| 180376 | ||
| / | 1072 | 0.6% |
| - | 164 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1565433 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 181728 | |
| 180376 | ||
| o | 177016 | |
| u | 169178 | |
| d | 167662 | |
| r | 166754 | |
| g | 166146 | |
| s | 164794 | |
| l | 29668 | 1.9% |
| a | 28316 | 1.8% |
| Other values (13) | 133795 |
cellphone_in_use
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| False | |
|---|---|
| True | 3929 |
| Value | Count | Frequency (%) |
| False | 194552 | |
| True | 3929 | 2.0% |
vehicle_type
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.5 MiB |
| not applicable | |
|---|---|
| sedan | |
| coupe | |
| hatchback | 1579 |
| minivan | 909 |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 11.519108 |
| Min length | 5 |
Characters and Unicode
| Total characters | 2286324 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | not applicable |
|---|---|
| 2nd row | not applicable |
| 3rd row | not applicable |
| 4th row | not applicable |
| 5th row | not applicable |
Common Values
| Value | Count | Frequency (%) |
| not applicable | 142865 | |
| sedan | 34928 | 17.6% |
| coupe | 18164 | 9.2% |
| hatchback | 1579 | 0.8% |
| minivan | 909 | 0.5% |
| other | 36 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 142865 | |
| applicable | 142865 | |
| sedan | 34928 | 10.2% |
| coupe | 18164 | 5.3% |
| hatchback | 1579 | 0.5% |
| minivan | 909 | 0.3% |
| other | 36 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 324725 | |
| p | 303894 | |
| l | 285730 | |
| e | 195993 | |
| n | 179611 | |
| c | 164187 | |
| o | 161065 | |
| i | 144683 | |
| t | 144480 | |
| b | 144444 | |
| Other values (9) | 237512 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2143459 | |
| Space Separator | 142865 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 324725 | |
| p | 303894 | |
| l | 285730 | |
| e | 195993 | |
| n | 179611 | |
| c | 164187 | |
| o | 161065 | |
| i | 144683 | |
| t | 144480 | |
| b | 144444 | |
| Other values (8) | 94647 | 4.4% |
Space Separator
| Value | Count | Frequency (%) |
| 142865 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2143459 | |
| Common | 142865 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 324725 | |
| p | 303894 | |
| l | 285730 | |
| e | 195993 | |
| n | 179611 | |
| c | 164187 | |
| o | 161065 | |
| i | 144683 | |
| t | 144480 | |
| b | 144444 | |
| Other values (8) | 94647 | 4.4% |
Common
| Value | Count | Frequency (%) |
| 142865 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2286324 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 324725 | |
| p | 303894 | |
| l | 285730 | |
| e | 195993 | |
| n | 179611 | |
| c | 164187 | |
| o | 161065 | |
| i | 144683 | |
| t | 144480 | |
| b | 144444 | |
| Other values (9) | 237512 |
vehicle_transmission
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.5 MiB |
| not applicable | |
|---|---|
| manual | |
| auto |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 11.537044 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2289884 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | not applicable |
|---|---|
| 2nd row | not applicable |
| 3rd row | not applicable |
| 4th row | not applicable |
| 5th row | not applicable |
Common Values
| Value | Count | Frequency (%) |
| not applicable | 143719 | |
| manual | 29385 | 14.8% |
| auto | 25377 | 12.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 143719 | |
| applicable | 143719 | |
| manual | 29385 | 8.6% |
| auto | 25377 | 7.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 371585 | |
| l | 316823 | |
| p | 287438 | |
| n | 173104 | |
| o | 169096 | |
| t | 169096 | |
| 143719 | 6.3% | |
| i | 143719 | 6.3% |
| c | 143719 | 6.3% |
| b | 143719 | 6.3% |
| Other values (3) | 227866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2146165 | |
| Space Separator | 143719 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 371585 | |
| l | 316823 | |
| p | 287438 | |
| n | 173104 | |
| o | 169096 | |
| t | 169096 | |
| i | 143719 | 6.7% |
| c | 143719 | 6.7% |
| b | 143719 | 6.7% |
| e | 143719 | 6.7% |
| Other values (2) | 84147 | 3.9% |
Space Separator
| Value | Count | Frequency (%) |
| 143719 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2146165 | |
| Common | 143719 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 371585 | |
| l | 316823 | |
| p | 287438 | |
| n | 173104 | |
| o | 169096 | |
| t | 169096 | |
| i | 143719 | 6.7% |
| c | 143719 | 6.7% |
| b | 143719 | 6.7% |
| e | 143719 | 6.7% |
| Other values (2) | 84147 | 3.9% |
Common
| Value | Count | Frequency (%) |
| 143719 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2289884 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 371585 | |
| l | 316823 | |
| p | 287438 | |
| n | 173104 | |
| o | 169096 | |
| t | 169096 | |
| 143719 | 6.3% | |
| i | 143719 | 6.3% |
| c | 143719 | 6.3% |
| b | 143719 | 6.3% |
| Other values (3) | 227866 |
vehicle_age
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3267567 |
| Minimum | 0 |
|---|---|
| Maximum | 161 |
| Zeros | 147442 |
| Zeros (%) | 74.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 8 |
| Maximum | 161 |
| Range | 161 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.7494633 |
|---|---|
| Coefficient of variation (CV) | 2.0723191 |
| Kurtosis | 118.51067 |
| Mean | 1.3267567 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.0748891 |
| Sum | 263336 |
| Variance | 7.5595485 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 147442 | |
| 3 | 10883 | 5.5% |
| 4 | 7077 | 3.6% |
| 2 | 6082 | 3.1% |
| 5 | 5461 | 2.8% |
| 6 | 3927 | 2.0% |
| 7 | 3826 | 1.9% |
| 8 | 3500 | 1.8% |
| 9 | 2779 | 1.4% |
| 1 | 2440 | 1.2% |
| Other values (10) | 5064 | 2.6% |
| Value | Count | Frequency (%) |
| 0 | 147442 | |
| 1 | 2440 | 1.2% |
| 2 | 6082 | 3.1% |
| 3 | 10883 | 5.5% |
| 4 | 7077 | 3.6% |
| 5 | 5461 | 2.8% |
| 6 | 3927 | 2.0% |
| 7 | 3826 | 1.9% |
| 8 | 3500 | 1.8% |
| 9 | 2779 | 1.4% |
| Value | Count | Frequency (%) |
| 161 | 2 | < 0.1% |
| 19 | 1 | < 0.1% |
| 17 | 3 | < 0.1% |
| 16 | 7 | < 0.1% |
| 15 | 41 | < 0.1% |
| 14 | 284 | 0.1% |
| 13 | 558 | 0.3% |
| 12 | 863 | |
| 11 | 1360 | |
| 10 | 1945 |
county_city_location
Real number (ℝ)
| Distinct | 509 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2772.833 |
| Minimum | 100 |
|---|---|
| Maximum | 5802 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 500 |
| Q1 | 1941 |
| median | 3000 |
| Q3 | 3700 |
| 95-th percentile | 5200 |
| Maximum | 5802 |
| Range | 5702 |
| Interquartile range (IQR) | 1759 |
Descriptive statistics
| Standard deviation | 1306.9054 |
|---|---|
| Coefficient of variation (CV) | 0.47132497 |
| Kurtosis | -0.37459915 |
| Mean | 2772.833 |
| Median Absolute Deviation (MAD) | 1058 |
| Skewness | 0.15853246 |
| Sum | 5.5035466 × 108 |
| Variance | 1708001.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1942 | 24276 | 12.2% |
| 1900 | 7735 | 3.9% |
| 3400 | 3716 | 1.9% |
| 3711 | 3347 | 1.7% |
| 1941 | 3137 | 1.6% |
| 4313 | 2981 | 1.5% |
| 1500 | 2811 | 1.4% |
| 109 | 2650 | 1.3% |
| 3001 | 2627 | 1.3% |
| 3300 | 2470 | 1.2% |
| Other values (499) | 142731 |
| Value | Count | Frequency (%) |
| 100 | 1332 | |
| 101 | 410 | 0.2% |
| 102 | 60 | < 0.1% |
| 103 | 551 | 0.3% |
| 104 | 240 | 0.1% |
| 105 | 785 | 0.4% |
| 106 | 749 | 0.4% |
| 107 | 517 | 0.3% |
| 108 | 168 | 0.1% |
| 109 | 2650 |
| Value | Count | Frequency (%) |
| 5802 | 8 | < 0.1% |
| 5801 | 57 | < 0.1% |
| 5800 | 240 | |
| 5704 | 388 | |
| 5703 | 324 | |
| 5702 | 12 | < 0.1% |
| 5701 | 117 | 0.1% |
| 5700 | 257 | |
| 5690 | 205 | |
| 5609 | 398 |
distance
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 2242 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 643.38082 |
| Minimum | 0 |
|---|---|
| Maximum | 1584000 |
| Zeros | 41751 |
| Zeros (%) | 21.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 14 |
| median | 100 |
| Q3 | 500 |
| 95-th percentile | 2640 |
| Maximum | 1584000 |
| Range | 1584000 |
| Interquartile range (IQR) | 486 |
Descriptive statistics
| Standard deviation | 8205.8205 |
|---|---|
| Coefficient of variation (CV) | 12.75422 |
| Kurtosis | 29129.625 |
| Mean | 643.38082 |
| Median Absolute Deviation (MAD) | 100 |
| Skewness | 159.4162 |
| Sum | 1.2769887 × 108 |
| Variance | 67335490 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 41751 | 21.0% |
| 100 | 8117 | 4.1% |
| 200 | 6698 | 3.4% |
| 50 | 5628 | 2.8% |
| 300 | 5594 | 2.8% |
| 500 | 5086 | 2.6% |
| 528 | 4786 | 2.4% |
| 1056 | 4539 | 2.3% |
| 150 | 3484 | 1.8% |
| 20 | 3423 | 1.7% |
| Other values (2232) | 109375 |
| Value | Count | Frequency (%) |
| 0 | 41751 | |
| 1 | 251 | 0.1% |
| 1.1 | 7 | < 0.1% |
| 1.17 | 2 | < 0.1% |
| 1.2 | 3 | < 0.1% |
| 1.25 | 2 | < 0.1% |
| 1.3 | 4 | < 0.1% |
| 1.33 | 1 | < 0.1% |
| 1.4 | 5 | < 0.1% |
| 1.5 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 1584000 | 4 | |
| 792000 | 2 | < 0.1% |
| 549120 | 2 | < 0.1% |
| 528000 | 1 | < 0.1% |
| 316800 | 2 | < 0.1% |
| 264000 | 2 | < 0.1% |
| 171600 | 1 | < 0.1% |
| 132000 | 5 | |
| 124080 | 1 | < 0.1% |
| 81312 | 1 | < 0.1% |
direction
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.3 MiB |
| north | |
|---|---|
| south | |
| unknown | |
| west | |
| east |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.0605549 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1004424 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | north |
|---|---|
| 2nd row | north |
| 3rd row | unknown |
| 4th row | south |
| 5th row | north |
Common Values
| Value | Count | Frequency (%) |
| north | 43646 | |
| south | 43491 | |
| unknown | 41121 | |
| west | 35255 | |
| east | 34968 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| north | 43646 | |
| south | 43491 | |
| unknown | 41121 | |
| west | 35255 | |
| east | 34968 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 167009 | |
| t | 157360 | |
| o | 128258 | |
| s | 113714 | |
| h | 87137 | |
| u | 84612 | |
| w | 76376 | |
| e | 70223 | |
| r | 43646 | 4.3% |
| k | 41121 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1004424 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 167009 | |
| t | 157360 | |
| o | 128258 | |
| s | 113714 | |
| h | 87137 | |
| u | 84612 | |
| w | 76376 | |
| e | 70223 | |
| r | 43646 | 4.3% |
| k | 41121 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1004424 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 167009 | |
| t | 157360 | |
| o | 128258 | |
| s | 113714 | |
| h | 87137 | |
| u | 84612 | |
| w | 76376 | |
| e | 70223 | |
| r | 43646 | 4.3% |
| k | 41121 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1004424 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 167009 | |
| t | 157360 | |
| o | 128258 | |
| s | 113714 | |
| h | 87137 | |
| u | 84612 | |
| w | 76376 | |
| e | 70223 | |
| r | 43646 | 4.3% |
| k | 41121 | 4.1% |
intersection
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 160887 | |
| True | 37594 | 18.9% |
weather_1
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.3 MiB |
| clear | |
|---|---|
| cloudy | |
| raining | 8267 |
| fog | 603 |
| unknown | 556 |
| Other values (3) | 639 |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.2363853 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1039323 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | clear |
|---|---|
| 2nd row | clear |
| 3rd row | clear |
| 4th row | clear |
| 5th row | clear |
Common Values
| Value | Count | Frequency (%) |
| clear | 158783 | |
| cloudy | 29633 | 14.9% |
| raining | 8267 | 4.2% |
| fog | 603 | 0.3% |
| unknown | 556 | 0.3% |
| snowing | 440 | 0.2% |
| other | 164 | 0.1% |
| wind | 35 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| clear | 158783 | |
| cloudy | 29633 | 14.9% |
| raining | 8267 | 4.2% |
| fog | 603 | 0.3% |
| unknown | 556 | 0.3% |
| snowing | 440 | 0.2% |
| other | 164 | 0.1% |
| wind | 35 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 188416 | |
| l | 188416 | |
| r | 167214 | |
| a | 167050 | |
| e | 158947 | |
| o | 31396 | 3.0% |
| u | 30189 | 2.9% |
| d | 29668 | 2.9% |
| y | 29633 | 2.9% |
| n | 19117 | 1.8% |
| Other values (8) | 29277 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1039323 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 188416 | |
| l | 188416 | |
| r | 167214 | |
| a | 167050 | |
| e | 158947 | |
| o | 31396 | 3.0% |
| u | 30189 | 2.9% |
| d | 29668 | 2.9% |
| y | 29633 | 2.9% |
| n | 19117 | 1.8% |
| Other values (8) | 29277 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1039323 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 188416 | |
| l | 188416 | |
| r | 167214 | |
| a | 167050 | |
| e | 158947 | |
| o | 31396 | 3.0% |
| u | 30189 | 2.9% |
| d | 29668 | 2.9% |
| y | 29633 | 2.9% |
| n | 19117 | 1.8% |
| Other values (8) | 29277 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1039323 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 188416 | |
| l | 188416 | |
| r | 167214 | |
| a | 167050 | |
| e | 158947 | |
| o | 31396 | 3.0% |
| u | 30189 | 2.9% |
| d | 29668 | 2.9% |
| y | 29633 | 2.9% |
| n | 19117 | 1.8% |
| Other values (8) | 29277 | 2.8% |
location_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.3 MiB |
| road | |
|---|---|
| highway | |
| ramp | 10906 |
| intersection | 3725 |
Length
| Max length | 12 |
|---|---|
| Median length | 4 |
| Mean length | 5.1862143 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1029365 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | road |
|---|---|
| 2nd row | road |
| 3rd row | road |
| 4th row | road |
| 5th row | road |
Common Values
| Value | Count | Frequency (%) |
| road | 115303 | |
| highway | 68547 | |
| ramp | 10906 | 5.5% |
| intersection | 3725 | 1.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| road | 115303 | |
| highway | 68547 | |
| ramp | 10906 | 5.5% |
| intersection | 3725 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 194756 | |
| h | 137094 | |
| r | 129934 | |
| o | 119028 | |
| d | 115303 | |
| i | 75997 | 7.4% |
| g | 68547 | 6.7% |
| w | 68547 | 6.7% |
| y | 68547 | 6.7% |
| m | 10906 | 1.1% |
| Other values (6) | 40706 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1029365 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 194756 | |
| h | 137094 | |
| r | 129934 | |
| o | 119028 | |
| d | 115303 | |
| i | 75997 | 7.4% |
| g | 68547 | 6.7% |
| w | 68547 | 6.7% |
| y | 68547 | 6.7% |
| m | 10906 | 1.1% |
| Other values (6) | 40706 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1029365 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 194756 | |
| h | 137094 | |
| r | 129934 | |
| o | 119028 | |
| d | 115303 | |
| i | 75997 | 7.4% |
| g | 68547 | 6.7% |
| w | 68547 | 6.7% |
| y | 68547 | 6.7% |
| m | 10906 | 1.1% |
| Other values (6) | 40706 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1029365 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 194756 | |
| h | 137094 | |
| r | 129934 | |
| o | 119028 | |
| d | 115303 | |
| i | 75997 | 7.4% |
| g | 68547 | 6.7% |
| w | 68547 | 6.7% |
| y | 68547 | 6.7% |
| m | 10906 | 1.1% |
| Other values (6) | 40706 | 4.0% |
primary_collision_factor
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| vehicle code violation | |
|---|---|
| other improper driving | 925 |
| fell asleep | 5 |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 21.999723 |
| Min length | 11 |
Characters and Unicode
| Total characters | 4366527 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | vehicle code violation |
|---|---|
| 2nd row | vehicle code violation |
| 3rd row | vehicle code violation |
| 4th row | vehicle code violation |
| 5th row | vehicle code violation |
Common Values
| Value | Count | Frequency (%) |
| vehicle code violation | 197551 | |
| other improper driving | 925 | 0.5% |
| fell asleep | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| vehicle | 197551 | |
| code | 197551 | |
| violation | 197551 | |
| other | 925 | 0.2% |
| improper | 925 | 0.2% |
| driving | 925 | 0.2% |
| fell | 5 | < 0.1% |
| asleep | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 595428 | |
| e | 594518 | |
| o | 594503 | |
| 396957 | ||
| v | 396027 | |
| l | 395117 | |
| c | 395102 | |
| n | 198476 | 4.5% |
| h | 198476 | 4.5% |
| d | 198476 | 4.5% |
| Other values (8) | 403447 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3969570 | |
| Space Separator | 396957 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 595428 | |
| e | 594518 | |
| o | 594503 | |
| v | 396027 | |
| l | 395117 | |
| c | 395102 | |
| n | 198476 | 5.0% |
| h | 198476 | 5.0% |
| d | 198476 | 5.0% |
| t | 198476 | 5.0% |
| Other values (7) | 204971 | 5.2% |
Space Separator
| Value | Count | Frequency (%) |
| 396957 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3969570 | |
| Common | 396957 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 595428 | |
| e | 594518 | |
| o | 594503 | |
| v | 396027 | |
| l | 395117 | |
| c | 395102 | |
| n | 198476 | 5.0% |
| h | 198476 | 5.0% |
| d | 198476 | 5.0% |
| t | 198476 | 5.0% |
| Other values (7) | 204971 | 5.2% |
Common
| Value | Count | Frequency (%) |
| 396957 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4366527 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 595428 | |
| e | 594518 | |
| o | 594503 | |
| 396957 | ||
| v | 396027 | |
| l | 395117 | |
| c | 395102 | |
| n | 198476 | 4.5% |
| h | 198476 | 4.5% |
| d | 198476 | 4.5% |
| Other values (8) | 403447 |
pcf_violation_category
Categorical
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.9 MiB |
| speeding | |
|---|---|
| improper turning | |
| automobile right of way | |
| unsafe lane change | |
| dui | |
| Other values (16) |
Length
| Max length | 26 |
|---|---|
| Median length | 25 |
| Mean length | 13.934618 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2765757 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | speeding |
|---|---|
| 2nd row | speeding |
| 3rd row | dui |
| 4th row | improper turning |
| 5th row | speeding |
Common Values
| Value | Count | Frequency (%) |
| speeding | 71917 | |
| improper turning | 33881 | |
| automobile right of way | 21064 | 10.6% |
| unsafe lane change | 17668 | 8.9% |
| dui | 17141 | 8.6% |
| unsafe starting or backing | 9322 | 4.7% |
| traffic signals and signs | 8521 | 4.3% |
| following too closely | 4518 | 2.3% |
| wrong side of road | 3700 | 1.9% |
| unknown | 3313 | 1.7% |
| Other values (11) | 7436 | 3.7% |
Length
| Value | Count | Frequency (%) |
| speeding | 71917 | |
| improper | 36819 | 8.8% |
| turning | 33881 | 8.1% |
| unsafe | 26990 | 6.5% |
| of | 26551 | 6.3% |
| right | 22851 | 5.5% |
| way | 22851 | 5.5% |
| automobile | 21064 | 5.0% |
| change | 17668 | 4.2% |
| lane | 17668 | 4.2% |
| Other values (28) | 120129 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 279894 | 10.1% |
| n | 271851 | 9.8% |
| i | 266831 | 9.6% |
| 219908 | 8.0% | |
| g | 193548 | 7.0% |
| r | 173429 | 6.3% |
| a | 164765 | 6.0% |
| s | 158617 | 5.7% |
| o | 157437 | 5.7% |
| p | 150107 | 5.4% |
| Other values (15) | 729370 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2545849 | |
| Space Separator | 219908 | 8.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 279894 | |
| n | 271851 | |
| i | 266831 | |
| g | 193548 | 7.6% |
| r | 173429 | 6.8% |
| a | 164765 | 6.5% |
| s | 158617 | 6.2% |
| o | 157437 | 6.2% |
| p | 150107 | 5.9% |
| t | 116938 | 4.6% |
| Other values (14) | 612432 |
Space Separator
| Value | Count | Frequency (%) |
| 219908 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2545849 | |
| Common | 219908 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 279894 | |
| n | 271851 | |
| i | 266831 | |
| g | 193548 | 7.6% |
| r | 173429 | 6.8% |
| a | 164765 | 6.5% |
| s | 158617 | 6.2% |
| o | 157437 | 6.2% |
| p | 150107 | 5.9% |
| t | 116938 | 4.6% |
| Other values (14) | 612432 |
Common
| Value | Count | Frequency (%) |
| 219908 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2765757 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 279894 | 10.1% |
| n | 271851 | 9.8% |
| i | 266831 | 9.6% |
| 219908 | 8.0% | |
| g | 193548 | 7.0% |
| r | 173429 | 6.3% |
| a | 164765 | 6.0% |
| s | 158617 | 5.7% |
| o | 157437 | 5.7% |
| p | 150107 | 5.4% |
| Other values (15) | 729370 |
road_surface
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.9 MiB |
| dry | |
|---|---|
| wet | |
| snowy | 937 |
| slippery | 137 |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.0128929 |
| Min length | 3 |
Characters and Unicode
| Total characters | 598002 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | dry |
|---|---|
| 2nd row | dry |
| 3rd row | dry |
| 4th row | dry |
| 5th row | dry |
Common Values
| Value | Count | Frequency (%) |
| dry | 178231 | |
| wet | 19176 | 9.7% |
| snowy | 937 | 0.5% |
| slippery | 137 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| dry | 178231 | |
| wet | 19176 | 9.7% |
| snowy | 937 | 0.5% |
| slippery | 137 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 179305 | |
| r | 178368 | |
| d | 178231 | |
| w | 20113 | 3.4% |
| e | 19313 | 3.2% |
| t | 19176 | 3.2% |
| s | 1074 | 0.2% |
| n | 937 | 0.2% |
| o | 937 | 0.2% |
| p | 274 | < 0.1% |
| Other values (2) | 274 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 598002 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 179305 | |
| r | 178368 | |
| d | 178231 | |
| w | 20113 | 3.4% |
| e | 19313 | 3.2% |
| t | 19176 | 3.2% |
| s | 1074 | 0.2% |
| n | 937 | 0.2% |
| o | 937 | 0.2% |
| p | 274 | < 0.1% |
| Other values (2) | 274 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 598002 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 179305 | |
| r | 178368 | |
| d | 178231 | |
| w | 20113 | 3.4% |
| e | 19313 | 3.2% |
| t | 19176 | 3.2% |
| s | 1074 | 0.2% |
| n | 937 | 0.2% |
| o | 937 | 0.2% |
| p | 274 | < 0.1% |
| Other values (2) | 274 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 598002 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 179305 | |
| r | 178368 | |
| d | 178231 | |
| w | 20113 | 3.4% |
| e | 19313 | 3.2% |
| t | 19176 | 3.2% |
| s | 1074 | 0.2% |
| n | 937 | 0.2% |
| o | 937 | 0.2% |
| p | 274 | < 0.1% |
| Other values (2) | 274 | < 0.1% |
road_condition_1
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.5 MiB |
| normal | |
|---|---|
| construction | 3215 |
| other | 751 |
| obstruction | 636 |
| holes | 589 |
| Other values (3) | 606 |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 6.1260221 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1215899 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | normal |
|---|---|
| 2nd row | normal |
| 3rd row | normal |
| 4th row | normal |
| 5th row | normal |
Common Values
| Value | Count | Frequency (%) |
| normal | 192684 | |
| construction | 3215 | 1.6% |
| other | 751 | 0.4% |
| obstruction | 636 | 0.3% |
| holes | 589 | 0.3% |
| loose material | 277 | 0.1% |
| reduced width | 223 | 0.1% |
| flooded | 106 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| normal | 192684 | |
| construction | 3215 | 1.6% |
| other | 751 | 0.4% |
| obstruction | 636 | 0.3% |
| holes | 589 | 0.3% |
| loose | 277 | 0.1% |
| material | 277 | 0.1% |
| reduced | 223 | 0.1% |
| width | 223 | 0.1% |
| flooded | 106 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 202492 | |
| n | 199750 | |
| r | 197786 | |
| l | 193933 | |
| a | 193238 | |
| m | 192961 | |
| t | 8953 | 0.7% |
| c | 7289 | 0.6% |
| s | 4717 | 0.4% |
| i | 4351 | 0.4% |
| Other values (8) | 10429 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1215399 | |
| Space Separator | 500 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 202492 | |
| n | 199750 | |
| r | 197786 | |
| l | 193933 | |
| a | 193238 | |
| m | 192961 | |
| t | 8953 | 0.7% |
| c | 7289 | 0.6% |
| s | 4717 | 0.4% |
| i | 4351 | 0.4% |
| Other values (7) | 9929 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 500 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1215399 | |
| Common | 500 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 202492 | |
| n | 199750 | |
| r | 197786 | |
| l | 193933 | |
| a | 193238 | |
| m | 192961 | |
| t | 8953 | 0.7% |
| c | 7289 | 0.6% |
| s | 4717 | 0.4% |
| i | 4351 | 0.4% |
| Other values (7) | 9929 | 0.8% |
Common
| Value | Count | Frequency (%) |
| 500 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1215899 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 202492 | |
| n | 199750 | |
| r | 197786 | |
| l | 193933 | |
| a | 193238 | |
| m | 192961 | |
| t | 8953 | 0.7% |
| c | 7289 | 0.6% |
| s | 4717 | 0.4% |
| i | 4351 | 0.4% |
| Other values (8) | 10429 | 0.9% |
lighting
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
| daylight | |
|---|---|
| dark with street lights | |
| dark with no street lights | |
| dusk or dawn | 6775 |
| dark with street lights not functioning | 444 |
Length
| Max length | 39 |
|---|---|
| Median length | 8 |
| Mean length | 12.692741 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2519268 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | dark with street lights |
|---|---|
| 2nd row | dark with street lights |
| 3rd row | daylight |
| 4th row | daylight |
| 5th row | dark with street lights |
Common Values
| Value | Count | Frequency (%) |
| daylight | 134738 | |
| dark with street lights | 42292 | 21.3% |
| dark with no street lights | 14232 | 7.2% |
| dusk or dawn | 6775 | 3.4% |
| dark with street lights not functioning | 444 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| daylight | 134738 | |
| dark | 56968 | |
| with | 56968 | |
| street | 56968 | |
| lights | 56968 | |
| no | 14232 | 3.6% |
| dusk | 6775 | 1.7% |
| or | 6775 | 1.7% |
| dawn | 6775 | 1.7% |
| not | 444 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 363498 | |
| i | 249562 | |
| h | 248674 | |
| d | 205256 | |
| 199574 | ||
| a | 198481 | |
| g | 192150 | |
| l | 191706 | |
| y | 134738 | 5.3% |
| s | 120711 | 4.8% |
| Other values (9) | 414918 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2319694 | |
| Space Separator | 199574 | 7.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 363498 | |
| i | 249562 | |
| h | 248674 | |
| d | 205256 | |
| a | 198481 | |
| g | 192150 | |
| l | 191706 | |
| y | 134738 | 5.8% |
| s | 120711 | 5.2% |
| r | 120711 | 5.2% |
| Other values (8) | 294207 |
Space Separator
| Value | Count | Frequency (%) |
| 199574 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2319694 | |
| Common | 199574 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 363498 | |
| i | 249562 | |
| h | 248674 | |
| d | 205256 | |
| a | 198481 | |
| g | 192150 | |
| l | 191706 | |
| y | 134738 | 5.8% |
| s | 120711 | 5.2% |
| r | 120711 | 5.2% |
| Other values (8) | 294207 |
Common
| Value | Count | Frequency (%) |
| 199574 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2519268 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 363498 | |
| i | 249562 | |
| h | 248674 | |
| d | 205256 | |
| 199574 | ||
| a | 198481 | |
| g | 192150 | |
| l | 191706 | |
| y | 134738 | 5.3% |
| s | 120711 | 4.8% |
| Other values (9) | 414918 |
collision_time
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.842882 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 5407 |
| Zeros (%) | 2.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 9 |
| median | 14 |
| Q3 | 17 |
| 95-th percentile | 21 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 5.6884002 |
|---|---|
| Coefficient of variation (CV) | 0.44292241 |
| Kurtosis | -0.51959289 |
| Mean | 12.842882 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.39087277 |
| Sum | 2549068 |
| Variance | 32.357896 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 15368 | 7.7% |
| 17 | 15202 | 7.7% |
| 16 | 14290 | 7.2% |
| 18 | 12897 | 6.5% |
| 14 | 12772 | 6.4% |
| 8 | 11741 | 5.9% |
| 13 | 11161 | 5.6% |
| 7 | 11094 | 5.6% |
| 12 | 10970 | 5.5% |
| 11 | 9239 | 4.7% |
| Other values (14) | 73747 |
| Value | Count | Frequency (%) |
| 0 | 5407 | |
| 1 | 3806 | 1.9% |
| 2 | 4020 | 2.0% |
| 3 | 2590 | 1.3% |
| 4 | 2007 | 1.0% |
| 5 | 3005 | 1.5% |
| 6 | 5211 | |
| 7 | 11094 | |
| 8 | 11741 | |
| 9 | 8755 |
| Value | Count | Frequency (%) |
| 23 | 4518 | 2.3% |
| 22 | 5266 | 2.7% |
| 21 | 6117 | 3.1% |
| 20 | 6563 | |
| 19 | 8171 | |
| 18 | 12897 | |
| 17 | 15202 | |
| 16 | 14290 | |
| 15 | 15368 | |
| 14 | 12772 |
month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0637744 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.6234079 |
|---|---|
| Coefficient of variation (CV) | 0.52987188 |
| Kurtosis | 2.0989853 |
| Mean | 3.0637744 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.86431594 |
| Sum | 608101 |
| Variance | 2.6354532 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 41566 | |
| 1 | 40844 | |
| 2 | 38907 | |
| 4 | 37395 | |
| 5 | 32586 | |
| 6 | 4100 | 2.1% |
| 8 | 858 | 0.4% |
| 9 | 676 | 0.3% |
| 7 | 563 | 0.3% |
| 10 | 386 | 0.2% |
| Other values (2) | 600 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 40844 | |
| 2 | 38907 | |
| 3 | 41566 | |
| 4 | 37395 | |
| 5 | 32586 | |
| 6 | 4100 | 2.1% |
| 7 | 563 | 0.3% |
| 8 | 858 | 0.4% |
| 9 | 676 | 0.3% |
| 10 | 386 | 0.2% |
| Value | Count | Frequency (%) |
| 12 | 286 | 0.1% |
| 11 | 314 | 0.2% |
| 10 | 386 | 0.2% |
| 9 | 676 | 0.3% |
| 8 | 858 | 0.4% |
| 7 | 563 | 0.3% |
| 6 | 4100 | 2.1% |
| 5 | 32586 | |
| 4 | 37395 | |
| 3 | 41566 |
| insurance_premium | vehicle_age | county_city_location | distance | collision_time | month | at_fault | party_sobriety | party_drug_physical | cellphone_in_use | vehicle_type | vehicle_transmission | direction | intersection | weather_1 | location_type | primary_collision_factor | pcf_violation_category | road_surface | road_condition_1 | lighting | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| insurance_premium | 1.000 | 0.157 | 0.033 | 0.023 | 0.007 | 0.006 | 0.155 | 0.373 | 0.365 | 0.045 | 0.118 | 0.178 | 0.038 | 0.092 | 0.018 | 0.129 | 0.024 | 0.090 | 0.022 | 0.012 | 0.114 |
| vehicle_age | 0.157 | 1.000 | 0.026 | 0.031 | 0.042 | 0.044 | 0.000 | 0.000 | 0.000 | 0.000 | 0.007 | 0.006 | 0.000 | 0.003 | 0.000 | 0.003 | 0.000 | 0.000 | 0.002 | 0.000 | 0.004 |
| county_city_location | 0.033 | 0.026 | 1.000 | 0.034 | 0.008 | 0.006 | 0.091 | 0.112 | 0.102 | 0.394 | 0.098 | 0.123 | 0.197 | 0.228 | 0.161 | 0.260 | 0.081 | 0.113 | 0.262 | 0.114 | 0.151 |
| distance | 0.023 | 0.031 | 0.034 | 1.000 | -0.040 | 0.023 | 0.000 | 0.005 | 0.000 | 0.000 | 0.000 | 0.000 | 0.003 | 0.000 | 0.000 | 0.000 | 0.000 | 0.004 | 0.000 | 0.000 | 0.000 |
| collision_time | 0.007 | 0.042 | 0.008 | -0.040 | 1.000 | 0.005 | 0.073 | 0.185 | 0.114 | 0.007 | 0.043 | 0.060 | 0.032 | 0.064 | 0.052 | 0.081 | 0.011 | 0.134 | 0.042 | 0.018 | 0.448 |
| month | 0.006 | 0.044 | 0.006 | 0.023 | 0.005 | 1.000 | 0.005 | 0.021 | 0.058 | 0.012 | 0.063 | 0.073 | 0.013 | 0.025 | 0.066 | 0.022 | 0.035 | 0.032 | 0.093 | 0.015 | 0.074 |
| at_fault | 0.155 | 0.000 | 0.091 | 0.000 | 0.073 | 0.005 | 1.000 | 0.409 | 0.353 | 0.016 | 0.145 | 0.043 | 0.015 | 0.017 | 0.034 | 0.033 | 0.005 | 0.122 | 0.046 | 0.015 | 0.068 |
| party_sobriety | 0.373 | 0.000 | 0.112 | 0.005 | 0.185 | 0.021 | 0.409 | 1.000 | 0.633 | 0.047 | 0.117 | 0.167 | 0.033 | 0.083 | 0.019 | 0.135 | 0.024 | 0.330 | 0.016 | 0.016 | 0.169 |
| party_drug_physical | 0.365 | 0.000 | 0.102 | 0.000 | 0.114 | 0.058 | 0.353 | 0.633 | 1.000 | 0.039 | 0.108 | 0.158 | 0.031 | 0.079 | 0.018 | 0.126 | 0.024 | 0.155 | 0.017 | 0.016 | 0.103 |
| cellphone_in_use | 0.045 | 0.000 | 0.394 | 0.000 | 0.007 | 0.012 | 0.016 | 0.047 | 0.039 | 1.000 | 0.007 | 0.006 | 0.011 | 0.008 | 0.011 | 0.017 | 0.002 | 0.022 | 0.019 | 0.005 | 0.002 |
| vehicle_type | 0.118 | 0.007 | 0.098 | 0.000 | 0.043 | 0.063 | 0.145 | 0.117 | 0.108 | 0.007 | 1.000 | 0.709 | 0.042 | 0.090 | 0.013 | 0.062 | 0.008 | 0.239 | 0.016 | 0.011 | 0.034 |
| vehicle_transmission | 0.178 | 0.006 | 0.123 | 0.000 | 0.060 | 0.073 | 0.043 | 0.167 | 0.158 | 0.006 | 0.709 | 1.000 | 0.029 | 0.053 | 0.010 | 0.029 | 0.008 | 0.098 | 0.011 | 0.011 | 0.038 |
| direction | 0.038 | 0.000 | 0.197 | 0.003 | 0.032 | 0.013 | 0.015 | 0.033 | 0.031 | 0.011 | 0.042 | 0.029 | 1.000 | 0.941 | 0.023 | 0.227 | 0.008 | 0.279 | 0.022 | 0.027 | 0.050 |
| intersection | 0.092 | 0.003 | 0.228 | 0.000 | 0.064 | 0.025 | 0.017 | 0.083 | 0.079 | 0.008 | 0.090 | 0.053 | 0.941 | 1.000 | 0.031 | 0.373 | 0.009 | 0.577 | 0.026 | 0.043 | 0.094 |
| weather_1 | 0.018 | 0.000 | 0.161 | 0.000 | 0.052 | 0.066 | 0.034 | 0.019 | 0.018 | 0.011 | 0.013 | 0.010 | 0.023 | 0.031 | 1.000 | 0.044 | 0.009 | 0.032 | 0.551 | 0.044 | 0.040 |
| location_type | 0.129 | 0.003 | 0.260 | 0.000 | 0.081 | 0.022 | 0.033 | 0.135 | 0.126 | 0.017 | 0.062 | 0.029 | 0.227 | 0.373 | 0.044 | 1.000 | 0.028 | 0.280 | 0.040 | 0.064 | 0.107 |
| primary_collision_factor | 0.024 | 0.000 | 0.081 | 0.000 | 0.011 | 0.035 | 0.005 | 0.024 | 0.024 | 0.002 | 0.008 | 0.008 | 0.008 | 0.009 | 0.009 | 0.028 | 1.000 | 1.000 | 0.007 | 0.000 | 0.007 |
| pcf_violation_category | 0.090 | 0.000 | 0.113 | 0.004 | 0.134 | 0.032 | 0.122 | 0.330 | 0.155 | 0.022 | 0.239 | 0.098 | 0.279 | 0.577 | 0.032 | 0.280 | 1.000 | 1.000 | 0.056 | 0.033 | 0.163 |
| road_surface | 0.022 | 0.002 | 0.262 | 0.000 | 0.042 | 0.093 | 0.046 | 0.016 | 0.017 | 0.019 | 0.016 | 0.011 | 0.022 | 0.026 | 0.551 | 0.040 | 0.007 | 0.056 | 1.000 | 0.099 | 0.035 |
| road_condition_1 | 0.012 | 0.000 | 0.114 | 0.000 | 0.018 | 0.015 | 0.015 | 0.016 | 0.016 | 0.005 | 0.011 | 0.011 | 0.027 | 0.043 | 0.044 | 0.064 | 0.000 | 0.033 | 0.099 | 1.000 | 0.027 |
| lighting | 0.114 | 0.004 | 0.151 | 0.000 | 0.448 | 0.074 | 0.068 | 0.169 | 0.103 | 0.002 | 0.034 | 0.038 | 0.050 | 0.094 | 0.040 | 0.107 | 0.007 | 0.163 | 0.035 | 0.027 | 1.000 |
| at_fault | insurance_premium | party_sobriety | party_drug_physical | cellphone_in_use | vehicle_type | vehicle_transmission | vehicle_age | county_city_location | distance | direction | intersection | weather_1 | location_type | primary_collision_factor | pcf_violation_category | road_surface | road_condition_1 | lighting | collision_time | month | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3702 | 30.0 | north | False | clear | road | vehicle code violation | speeding | dry | normal | dark with street lights | 11 | 1 |
| 1 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3702 | 30.0 | north | False | clear | road | vehicle code violation | speeding | dry | normal | dark with street lights | 11 | 1 |
| 2 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 1961 | 0.0 | unknown | False | clear | road | vehicle code violation | dui | dry | normal | daylight | 12 | 4 |
| 3 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 4310 | 680.0 | south | False | clear | road | vehicle code violation | improper turning | dry | normal | daylight | 9 | 2 |
| 4 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3702 | 30.0 | north | False | clear | road | vehicle code violation | speeding | dry | normal | dark with street lights | 11 | 1 |
| 5 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 1941 | 70.0 | east | False | clear | road | vehicle code violation | unknown | dry | normal | dusk or dawn | 6 | 4 |
| 6 | False | 64.0 | had not been drinking | no drugs | False | hatchback | auto | 10.0 | 3711 | 289.0 | south | False | clear | road | vehicle code violation | speeding | dry | normal | daylight | 7 | 2 |
| 7 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3616 | 650.0 | east | False | raining | road | vehicle code violation | unknown | wet | normal | dark with street lights | 20 | 4 |
| 8 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 4312 | 30.0 | south | False | cloudy | road | vehicle code violation | unsafe starting or backing | dry | normal | daylight | 14 | 3 |
| 9 | False | 0.0 | not applicable | not applicable | False | not applicable | not applicable | 0.0 | 2900 | 432.0 | north | False | clear | road | vehicle code violation | speeding | dry | normal | dark with no street lights | 2 | 1 |
| at_fault | insurance_premium | party_sobriety | party_drug_physical | cellphone_in_use | vehicle_type | vehicle_transmission | vehicle_age | county_city_location | distance | direction | intersection | weather_1 | location_type | primary_collision_factor | pcf_violation_category | road_surface | road_condition_1 | lighting | collision_time | month | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 198471 | False | 39.0 | had not been drinking | no drugs | False | sedan | manual | 14.0 | 3026 | 1900.0 | south | False | clear | highway | vehicle code violation | unsafe lane change | dry | normal | daylight | 9 | 1 |
| 198472 | True | 50.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3313 | 1584.0 | west | False | clear | highway | vehicle code violation | speeding | dry | normal | dark with no street lights | 17 | 1 |
| 198473 | False | 23.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3313 | 1584.0 | west | False | clear | highway | vehicle code violation | speeding | dry | normal | dark with no street lights | 17 | 1 |
| 198474 | True | 27.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3026 | 50.0 | north | False | clear | highway | vehicle code violation | speeding | dry | normal | daylight | 15 | 1 |
| 198475 | False | 25.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3026 | 50.0 | north | False | clear | highway | vehicle code violation | speeding | dry | normal | daylight | 15 | 1 |
| 198476 | False | 39.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3026 | 50.0 | north | False | clear | highway | vehicle code violation | speeding | dry | normal | daylight | 15 | 1 |
| 198477 | True | 0.0 | impairment unknown | G | False | not applicable | not applicable | 0.0 | 3300 | 66.0 | west | False | clear | road | vehicle code violation | unsafe starting or backing | dry | normal | dark with street lights | 22 | 1 |
| 198478 | True | 24.0 | had been drinking, under influence | no drugs | False | not applicable | not applicable | 0.0 | 3313 | 15.0 | south | False | clear | highway | vehicle code violation | dui | dry | normal | dark with street lights | 18 | 1 |
| 198479 | False | 59.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3313 | 15.0 | south | False | clear | highway | vehicle code violation | dui | dry | normal | dark with street lights | 18 | 1 |
| 198480 | True | 27.0 | had not been drinking | no drugs | False | sedan | manual | 1.0 | 3394 | 133.0 | north | False | clear | road | vehicle code violation | unsafe starting or backing | dry | normal | daylight | 15 | 1 |
Most frequently occurring
| at_fault | insurance_premium | party_sobriety | party_drug_physical | cellphone_in_use | vehicle_type | vehicle_transmission | vehicle_age | county_city_location | distance | direction | intersection | weather_1 | location_type | primary_collision_factor | pcf_violation_category | road_surface | road_condition_1 | lighting | collision_time | month | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 755 | False | 0.0 | not applicable | not applicable | False | not applicable | not applicable | 0.0 | 1920 | 94.0 | east | False | clear | road | vehicle code violation | dui | dry | normal | dark with street lights | 3 | 2 | 11 |
| 441 | False | 0.0 | not applicable | not applicable | False | not applicable | not applicable | 0.0 | 0109 | 0.0 | unknown | True | clear | road | vehicle code violation | improper turning | dry | normal | dusk or dawn | 6 | 4 | 8 |
| 1979 | False | 0.0 | not applicable | not applicable | False | not applicable | not applicable | 0.0 | 4203 | 159.0 | south | False | clear | road | vehicle code violation | improper turning | dry | normal | daylight | 9 | 4 | 7 |
| 291 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 3801 | 0.0 | unknown | True | clear | road | vehicle code violation | other equipment | dry | construction | dark with street lights | 1 | 8 | 6 |
| 406 | False | 0.0 | not applicable | not applicable | False | not applicable | not applicable | 0.0 | 0101 | 36.0 | west | False | raining | road | vehicle code violation | speeding | wet | normal | daylight | 9 | 2 | 6 |
| 1237 | False | 0.0 | not applicable | not applicable | False | not applicable | not applicable | 0.0 | 1942 | 251.0 | south | False | clear | road | vehicle code violation | speeding | dry | normal | daylight | 16 | 5 | 6 |
| 1282 | False | 0.0 | not applicable | not applicable | False | not applicable | not applicable | 0.0 | 1942 | 390.0 | west | False | clear | road | vehicle code violation | dui | dry | normal | dark with street lights | 3 | 4 | 6 |
| 2082 | False | 0.0 | not applicable | not applicable | False | not applicable | not applicable | 0.0 | 4905 | 48.0 | south | False | clear | road | vehicle code violation | dui | dry | normal | dark with street lights | 22 | 1 | 6 |
| 35 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 1008 | 47.0 | west | False | clear | highway | vehicle code violation | improper turning | dry | normal | daylight | 12 | 4 | 5 |
| 91 | False | 0.0 | had not been drinking | no drugs | False | not applicable | not applicable | 0.0 | 1941 | 100.0 | east | False | unknown | road | vehicle code violation | unsafe lane change | dry | normal | daylight | 10 | 4 | 5 |